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RAW SEQUENCE LISTING DATE: 10/07/2004 

PATENT APPLICATION: US/10/019 , 359 TIME: 13:45:04 

Input Set : A:\PTO.FG.txt 

Output Set: N: \CRF4\10072004\J019359 • raw 

<110> APPLICANT: K. U. Leuven Research & Development 
Debyser, Zeger 
De Clercq, Erik 
Cherepanov, Peter 
Pluymers, Wim 

<120> TITLE OF INVENTION: A synthetic gene for expression of a retroviral protein with 



type acitivity in eukaryotic cells 
<130> FILE REFERENCE: K1291-PCT 
<140> CURRENT APPLICATION NUMBER: 10/019,359 
<141> CURRENT FILING DATE: 2002-04-11 
<150> PRIOR APPLICATION NUMBER: EP99201306.0 
<151> PRIOR FILING DATE: 1999-04-26 
<150> PRIOR APPLICATION NUMBER: EP00200171.1 
<151> PRIOR FILING DATE: 2000-01-18 
<160> NUMBER OF SEQ ID NOS : 2 
<170> SOFTWARE: Patentin version 3,3 
<210> SEQ ID NO: 1 
<211> LENGTH: 930 
<212> TYPE: DNA 

<213> ORGANISM: Artificial sequence 
<22 0> FEATURE: 

<223> OTHER INFORMATION: description of artificial sequence 



ENTERED 



synthetic gene encoding 



integrase 
<220> FEATURE: 
<221> NAME/KEY: misc_signal 
<222> LOCATION: (24) . . (30) 
<223> OTHER INFORMATION: Kozak sequence 
<220> FEATURE: 
<221> NAME/KEY: CDS 
<222> LOCATION: (27) . . (899) 
<400> SEQUENCE: 1 

atcactagca acctcaaaca gacacc atg gga ttc ctg gac ggc att gac aag 

Met Gly Phe Leu Asp Gly lie Asp Lys 
1 5 
get cag gag gag cac gag aag tac cac teg aat tgg egg gcc atg gee 
Ala Gin Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala Met Ala 
10 15 20 25 

tec gac ttc aac ctg cca ccc gtc gtc get aag gag ate gtt get age 
Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu lie Val Ala Ser 

30 35 40 

tgc gac aag tgc cag ctg aaa ggc gag get atg cac ggg cag gtt gat 
Cys Asp Lys Cys Gin Leu Lys Gly Glu Ala Met His Gly Gin Val Asp 
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197 
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Output Set: N:\CRF4\10072004\J019359.raw 

63 tgc tct ccc ggc ate tgg cag etc gac tgt act cac ctg gag ggc aag 245 

64 Cys Ser Pro Gly lie Trp Gin Leu Asp Cys Thr His Leu Glu Gly Lys 

65 60 65 70 

67 gtc ate ctg gtc gcc gtg cac gtg gcc tct ggt tac ate gag -get gag 293 

68 Val He Leu Val Ala Val His Val Ala Ser Gly Tyr He Glu Ala Glu 

69 75 80 85 

71 gtc ate cet gea gag act ggc cag gag act gcc tat ttc ctg ctg aaa 

72 Val He Pro Ala Glu Thr Gly Gin Glu Thr Ala Tyr Phe Leu Leu Lys 

73 90 95 100. 105 

75 ctg gee" ggc egg tgg cet gtg aag aca gtg cac aca gat aac ggc tec 389 

76 Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn Gly Ser 

77 ' 110 115 120 

79 aac ttc acc tec acc act gtg aag get gee tgc tgg tgg get ggg ate 437 

80 Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala Gly He 

81 125 130 135 

83 aag cag gag ttc ggg ate ccc tat aac cea cag tct cag ggc gtg ate 485 

84 Lys Gin Glu Phe Gly He Pro Tyr Asn Pro Gin Ser Gin Gly Val He 

85 140 145 150 

8 7 gaa tec atg aac aag gag ctg aag aag ate ate ggc cag gtt egg gac 533 

88 Glu Ser Met Asn Lys Glu Leu Lys Lys He He Gly Gin Val Arg Asp 

89 155 160 165 

91 cag gea gag cac ctg aag act gea gtg cag atg gee gtg ttc ate cac 581 

92 Gin Ala Glu His Leu Lys Thr Ala Val Gin Met Ala Val Phe He His 

93 170 175 180 185 

95 aac ttc aag ega aag ggc ggc ate ggt ggc tac tea gee ggc gag egg 629 

96 Asn Phe Lys Arg Lys Gly Gly He Gly Gly Tyr Ser Ala Gly Glu Arg 

97 190 195 200 

9 9 ate gtg gac ate ate gee act gac ate cag ace aaa gag ctg cag aag 677 

100 He Val Asp He He Ala Thr Asp He Gin Thr Lys Glu Leu Gin Lys 

101 205 210 215 

103 cag ate ace aag ate cag aac ttc egt gtg tac tac egg gac tec egg 725 

104 Gin He Thr Lys He Gin Asn Phe Arg Val Tyr Tyr Arg Asp Ser Arg 

105 220 225 230 

107 gac cet gtg tgg aag ggc cet gee aag ctg ctg tgg aag ggc gag ggc 773 

108 Asp Pro Val Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly Glu Gly 

109 235 240 245 

111 gcc gtgvjgtc att cag gac aac tct gac ate aag gtt gtg ccc agg cgc 821 

112 Ala Val Val He Gin Asp Asn Ser Asp He Lys Val Val Pro Arg Arg 

113 250 255 260 265 

115 aag gcc aag att ate egg gac tac ggc aag cag atg get ggc gac gac 869 

116 Lys Ala Lys He lie Arg Asp Tyr Gly Lys Gin Met Ala Gly Asp Asp 

117 270 275 280 

119 tgt gtg gcc tct cgt caa gat gag gac taa gtccaactae taaactgggg 919 
12 0 Cys Val Ala Ser Arg Gin Asp Glu Asp 
121 285 290 

123 gatattatga t 930 

127 <210> SEQ ID NO: 2 

128 <211> LENGTH: 290 
12 9 <212> TYPE: PRT 
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INFORMATION: Synthetic Construct 
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L:15 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
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